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Abstract 



We study the Fokker-Planck equation as the thermodynamic Hmit of a stochastic particle 
system on one hand and as a Wasserstein gradient flow on the other. We write the rate 
^<0 ' functional, which characterizes the large deviations from the thermodynamic limit, in such 

C^ ! a way that the free energy appears explicitly. Next we use this formulation via the con- 

(<— ^ I traction principle to prove that the discrete time rate functional is asymptotically equivalent 

^J • in the Gamma-convergence sense to the functional derived from the Wasserstein gradient 

(^ , discretization scheme. 

(N 



1 Introduction 



Since the seminal work of Jordan, Otto and Kinderlehrer |JK098] . it has become clear that 
there are many more partial differential equations that can be written as a gradient flow than 
previously known. Two important insights have contributed to this: the generalisation of gra- 
dient flows to metric spaces and the specific choice of the Wasserstein metric as the dissipation 
mechanism. The paper by Jordan, Kinderlehrer and Otto introduced a gradient-flow structure 
by approximation in discrete time. More recent work have shown how these ideas can be studied 
in continuous time [OttOlj . and how they can be generalised to any metric space |AGS08] . This 
paper is mainly concerned with the time-discrete scheme, which we shall now explain. 

' 2f-Dd 



A gradient flow in L (r ) is an evolution equation of the form 



dp 

— = -gradi2-F(p), (1) 



for some functional T . For a gradient flow it is natural to use the following time-discrete variational 
scheme. If po is the solution at time t = 0, then the solution at time r > is approximated by the 
minimiser of the functional 

Indeed, the Euler-Lagrange equation is then ^^~^° = —^YdAip.T{pr^, which clearly approximates 
([T]) as r — )■ 0. In the same manner, one can define a variational scheme by minimising the functional 

P^'^(P) + ^W^I(P,P0), (2) 

where W2 is the Wasserstein metric. Convergence of this variational scheme was first proven in 
(JK098| with the choice of T{p) := S{p) ^ E{p), where 

E{p) = [ nx)p{dx) and S{p) := l^-^ ^^^^ ^°^^^^^ ^^' ^°^ ^^^^^ = ^^^^ ^^ (3) 
jRd loo, otherwise, 

for some potential \E'. In this case, the minimisers converge to the solution of the Fokker-Planck 
equation 

^ = Ap + div(pVvI/). (4) 

Later, in [OttOT], this result was extended to more general J-", but we will be concerned with the 
specific choice ([3]). Physically, S can be interpreted as entropy, S as internal energy, and J-" as the 
corresponding Helmholtz free energy (if the temperature effects are hidden in \l/); hence it is not 
surprising that this free energy should decay along solutions of (j4]). However, it is not intuitively 
clear why the dissipation of free energy must be described by the Wasserstein metric. 

As we will explain in Section [3l for systems in equilibrium, the stochastic fluctuations around 
the equilibrium are characterised by a free energy similar to (|3]). Recent developments suggest a 
similar principle for systems away from equilibrium |Leo07t lADPZlOJ IPRllt IDLZlOt IADPZ12] . 
To explain this, consider A^ independent random particles in R'' with positions Xk{t), initially dis- 
tributed by some po G V(R'^), where the probability distribution of each particle evolves according 
to (JH). Define the corresponding empirical process 



: t ^ 



1 ^ 



fc=i 

Then, as a consequence of the Law of Large Numbers, at each r > the empirical measure L^qir) 
converges almost surely in the narrow topology as A^ — >■ cxo to the solution of the Fokker-Planck 
equation (jl]) with initial condition po |Dud89] : this is sometimes known as the thermodynamic limit. 
The rate of this convergence is characterised by a large deviation principle. Roughly speaking, this 
means that there exists a Jr : V{R'^) — ?■ [0, 00] such that (see Section |3]) 

Prob {Ln{t) ^ p I Ln{0) ^ Po) ~ exp (-A^ Jt{p\Po)) as A^ -)■ cx). 



In |Leo07l Prop. 3.2] and |PRlll Cor. 13], it was found that 

Jr{p\po) = inf |'H(7|po®Pr) : 7 e n(po,p)|, (5) 



where l-L is the relative entropy (discussed in Section |3]), pt is the fundamental solution of the 
Fokker-Planck equation (jl]) and n(po,p) is the set of all Borel measures in R^'^ that have first and 
second marginal po and p respectively. In this paper, we characterise a class of potentials ^ and 
initial data po for which ([5]) is equal to 



2 






•(6) 

where the || ■ ||-i,p norm and the exact meaning of grad J-" will be defined in the sequel. In the 
main theorem, by using the above equality, we show that the Wasserstein scheme |JK098] has the 
same asymptotic behavior with J^ for r — >■ 0, in terms of Gamma-convergence (see |Bra02j for an 
exposition of Gamma-convergence). 

Theorem 1.1. Let po = po{x)dx G P2(R) be absolutely continuous with respect to the Lesbegue 
measure with po{x) is bounded from below by a positive constant in every compact set. Assume 
that J-'(po), ||Apo||?.ipg and j^ |V^(x)ppo((ia:) are all finite, and that \E' G C^(r) satisfies either 
Assumption \4.1\ or 4-4 (introduced in Section^. Then we have 



Jri- \P0) - ^^^%^ ^l^^-^- l^^P^^^ ^^ ^2(^)- (7) 



Here P2(R) denotes the space of probability measures on R having finite second moment. As we 
will prove, the Gamma-convergence result holds if 7^2 (R) is equipped with the narrow topology, as 
well as if we equip it with the Wasserstein topology. More precisely: we will prove the lower bound 
in the narrow topology (Theorem lS.ip . and the existence of the recovery sequence (Theorem 16. ip in 
the Wasserstein topology. In the Wasserstein topology, the Gamma-convergence ([7]) immediately 
implies: 

rJri-\po)^\w^iPo,-) inP2(R). (8) 



For a system of Brownian particles, i.e. ^ = 0, statement (IE]) can also be found in |Leo07] . 



Together, the two statements (JTj) and ([8]) make up an asymptotic development of the rate Jr for 
small r, i.e. 

MpIpo) ~ l^p) - Ij'iPo) + ^W^ipo,p). 

Apart from the factor 1/2 and the constant J^{po), which do not affect the minimisers, this ap- 
proximation indeed corresponds to the functional defining the time-discrete variational scheme ([2]) 
from |.TK()98] . 

For \E' = 0, the main statement (171) was proven in J ADPZIO] in a subset of 7^2(1^) consisting of 
measures that are sufficiently close to a uniform distribution on a compact interval. In |PR11] . it 
was proven that whenever ([7]) holds for \E' = 0, then it also holds for any \I' G C^(R'^). Both papers 
make use of the specific form of the fundamental solution of (jl]). In |DLZ10] . ([7]) was shown for 
Gaussian measures on the real line. In our approach, using the path-wise large deviations, we can 
avoid using the fundamental solution, allowing us to prove the statement in a much more general 
context. 



All theorems in this paper also work in higher dimensions, except for the existence of the 
recovery sequence in the main theorem. This has to do with the fact that in one dimension the 
optimal transport plan between two measures with equal tails will be the identity at the tails. 
However, this argument fails in higher dimensions. We belief that the recovery sequence also 
exists in higher dimensions but this is left for future research. 

The required concepts of this paper are introduced in Section [21 In Section [3l we explain the 
concept of large deviations in the case of an equilibrium system, introduce the dynamical particle 
system that we study more precisely, and discuss the conditional large deviations for this system. 
The alternative form of the functional Jh is proven in Section H] via the path-wise large deviation 
principles. Finally, in Section |5] we prove the Gamma-convergence lower bound, and in Section [6] 
the existence of the recovery sequence. 



2 Preliminaries 



By the nature of this study, we need a combination of techniques from probability theory, 
mostly from the theory of large deviations, and from functional analysis, mostly from the gradient 
flow calculus as set out in |AGS08j . Let us introduce these concepts here. 

To begin, let us discuss the topological measure spaces. Unless otherwise stated, the space 
of probability measures 'P(R'^) will be endowed with the narrow topology, defined by convergence 
against continuous bounded test functions: 



Pi — > p as t — )■ if and only if / dpt -^ (pdp for all (p E Cfc(R 



d 



,d\ 



We sometimes identify measures with densities when possible, which is typically the case if a 
measure has finite entropy. The space 'P2(R'^) = {p € ■P(R'^) : Jlx]"^ p{dx) < cxo} will be endowed 
with the topology generated by the Wasserstein metric W2- The Wasserstein distance of two 
measures po,p G 7^2 (R"') is defined via 



l^|(po,p)= inf (/ / \x~y\'d^\ 
7en(po,p) IJ^n J^n ) 



Convergence in the Wasserstein topology can be characterised as (see e.g. |Vil03lRGS08j ): 
Pt — )■ p as t — )> if and only if (i) Pt -^ P narrowly, and 

(ii) / |a;p(ip(— 7- / {x]"^ dp. 



We write C([0, 1],'P(R'^)) for the space of narrowly continuous curves [0,1] — )■ P(R'^), and 
C(po, p) for the space of narrowly continuous curves [0, 1] — )■ V{R'^) starting in po and ending in p. 
Similarly, for Wasserstein-continuous curves in P2(R'^) we write Cvi/2([0, 1],P2(R'')) and Cw2{Po,p)- 



Furthermore, we use two different notions of absolutely continuous curves. The first notion is 
taken from |DG87l Def. 4.1]. Let V = C^{R'^) be the space of test functions with the corresponding 
topology (see |Rud73t Sect. 6.3]), let V be its dual, consisting of the associated distributions, and 
let ( , ) be the dual pairing between V and V. We will identify a measure p G 'P(R°') with a 
distribution by setting (p, /) := J f dp. Denote by Vk C V the subspace of all Schwartz functions 
with compact support K C R*^. Then a curve p(.) : [0, 1] — )■ V is said to be absolutely continuous 
in the distributional sense if for each compact set K GR'^ there is a neighborhood Uk of in Vk 
and an absolutely continuous function Gk : [0, 1] — )■ R such that 

\{Pt,J)-{Pt,J)\<\GK{t2)-GK{tl)l 

for all < ti, ^2 < 1 and / G Uk- We denote by AC([0, 1]; V) the set of all absolutely continuous 
maps in distributional sense. Note that if a map p(.) : [0, 1] — )■ V is absolutely continuous then the 
derivative in the distributional sense pt = liniT-^o ^{Pt+r ~ Pt) exists for almost all t G [0, 1]. 

Secondly, we say a curve p(.) : [0, 1] — ?■ V^iR^^ is absolutely continuous in the Wasserstein sense 
if there exists a (? G L^{0, 1) such that 

*2 



W2ipt„Pt,)< f 9{t)dt 



for all < ti < t2 < 1 (see for example |AGS08j ). We denote the set of absolutely continuous 
curves as ACvy2([0, l];p2(R'^))- 



LHpt) 



For an absolutely continuous curve p(.) there is a unique Borel field Vt ^ V := {Vp : p G V} 
such that the continuity equation holds tAGSOSl Th. 8.3.1]: 

— — h div(pit>t) = in distributional sense. (9) 

This motivates the identification of the tangent spacqj of P2(R'^) at p with all s E T>' for which 
there exists a v E V such that 

s + div(pt') = in distributional sense. (10) 

The following inner product on the tangent space at p is the metric tensor corresponding to the 
Wasserstein metric |Ott01] 

(Sl,S2)_l,p := 77 / vi ■V2dp, 

where Vi and V2 are associated with si and S2 through (TTOj) . The corresponding norm coincides 
with the dual operator norm on V 



\s\\\^ := sup \{s,p) -\ I \Wp\''dp\ . (11) 



This norm is closely related to the Wasserstein metric through the Benamou-Brenier formula 
[BBOO] 



W2{po, pi)'^ = min I / ||-— ^H^.^^^^ c/t : pt|i=o = Po and pt|i=i = pi L 



(12) 



^Here we like to point out that in [AG SO 8) the tangent space is identified with the set of velocity fields V. 



Observe that, in approximation, any small perturbation pt from a p G P2(R°') can be specified 
by a potential p G P such that ([9]) holds with po = p and v = Vp. Following |FK06l Definition 
9.36], for any J^: V{R'^) — ?■ [—00, +00], we write, if it exists, gradJ-'(p) for the unique element in 
V such that for each p eV and each p(.) : [0, 00) — )■ V(R'^) satisfying ([9]) with po = p and v = Vp, 
we have 

lim = (grad J-'(p), »). 

Let J^{p) = £{p) + S{p) be the free energy defined as in ([3]). By |FK06[ Theorem D.28], if 
J-'(p) < 00, then 

gradj^(p) = -(Ap + div(pV^)) in V'{r'^). 

The following functional plays a central role in this paper 

||Api|% =1^-''^^^ if p(rfx)=p(x)rfa: and ypG if HR'^), ^^3^ 

'^ 1 00 otherwise, 

where Vp is the distributional derivative of p. This functional is also known as the Fisher infor- 
mation. 

We conclude this section with two results that we will need. 

Lemma 2.1. \A GS081 Th. 8.3.1] Let p(,) : (0, r) — >■ P(R'^) he a narrowly continuous curve and let 
i)(.) : (0,r) -^ V he a vector field such that the continuity equation holds. If 



Po € P2(R'^) and I \\Vt\\^2( \dt < cxD 



(14) 



then Pt G 'P2{'R'^) for all < t < t and p(.) is ahsolutely continuous in the Wasserstein sense. 

Remark 2.2. We point out that the hypothesis in Lemma \2.1\ requires a priori that the curve p(.) 
lies in V2{'R'^), hut the proof actually shows that the condition iHM implies the whole curve to he 
in 7^2 (R'^) (and it is ahsolutely continuous in the Wasserstein sense). 

Lemma 2.3. Assume that p(^.y. (0, r) — )■ V2{R'^) is a Wasserstein-ahsolutely continuous curve. 

1. If ^ E C^(R'^) is convex, hounded from helow, and it satisfies the conditions 

£{pt) <oo VtG[0,r] and / / \V^!{x)\^ pt{x) dxdt < +00, 

Jo Jr'* 

then t H- £{pt) is ahsolutely continuous. 

2. If 

S{pt) < 00 Vi G [0,r] and / \\Apt\Wp^ dt < 00, 

Jo 

then t h-)> S{pt) is ahsolutely continuous. 

If the conditions in hoth parts are satisfied, then grad J-'(pt) exists and the following chain 
rule holds 

— J'(pt) = (gradJ'(pt),— Pi ) . (15) 



dt "^'' V" "^''' dt' , ,, 



Proof. This Lemma is a direct consequence of JXGS081 Th. 10.3.18]. Since the functionals £^(p) and 
S{p) are lower semicontinuous and geodesically convex, we only need to check condition |AGS08 



10.1.17]. This condition in turn is satisfied by the Cauchy-Schwartz inequality {f,g)L^(a,b) ^ 
L'^{a,b)\\g\\L'^(a,b) a^d the assumptious. D 



3 Particle system and conditional large deviations 



In this section we first explain the concept of large deviations with a simple model particle 
system. Then, we introduce the dynamic particle system that we study more precisely, and discuss 
the large deviation principle for this system. 

Consider a system of independent random particles in R'' (without dynamics), where the po- 
sitions Xi, . . . ,Xn are identically distributed with law po- Then as a consequence of the law of 
large numbers L^ — )■ Po almost surely in the narrow topology as N ^ oo |Dud89l Th. 11.4.1]. 
Naturally, this implies weak convergence: 

lim Prob(L7vGC) = 6p,{C) 

for all continuity sets C C 'P(R'^) in the narrow topology. A large deviation principle quantifies 
the exponential rate of convergence to (or 1). More precisely, we say the system satisfies a large 
deviation principle in V{R'^) with (unique) rate J : V{R'^) — )■ [0, oo] if J is lower semicontinuous, 
and for all sets U C P(R'^) there holds (see, for example |DZ87] ) 

— inf J < lim inf — log Prob(LAr G f/°) < lim sup — log Prob(LAf G f/) < — inf J. 

In addition, we say a rate functional is good if it has compact sub-level sets. By Sanov's Theorem 
[DZ87[ Th. 6.2.10], our model example indeed satisfies a large deviation principle, where the good 
rate functional J{p) is the relative entropy 

H(p|p»):=^>S<*)*' "''«''°' (16) 

cxD, otherwise. 

In this example we see the (relative) entropy appearing naturally from a limit of a simple particle 

system. 

Let us now consider our particle system with dynamics, and study its Sanov-type large de- 
viations. To define the system more precisely, let Xi(t),--- ,XAr(t) be a sequence of indepen- 
dent random processes in R"^. Assume that the initial values are fixed deterministically by some 
Xi(0) = Xi, . . . Xjv(O) = xn in such a way that g 

-^Af(O) — )■ Po narrowly for some given pQ E V{R'^). (17) 



^The reason behind this specific initial condition is that we want to somehow condition on Ljv = P, which is a 
measure-0 set. 



The evolution of the system is prescribed by the same transition probabihty for each parti- 
cle Prob(Xfc(t) G dy\Xk{0) = x) = pt{dy\x). Naturally, for such probability there must hold 
Pt{dy\x) — )■ 5x{dy) narrowly as t — )■ 0, and it should evolve according to (j4]). We thus define pt to 
be the fundamental solution of (jl]) q 

Again by the law of large numbers Ln{t) — )■ pr almost surely in 'P(R'^), where pr = po * Pt, 
the solution of (jl]) at time r with initial condition po- In addition, the empirical measure L]^{t) 
satisfies a large deviation principle 

Prob {Ln{t) ^ p) ~ exp (— A^J^(p|po)) as A^ — )■ oo. 

with good rate functional (|5]). Observe that Jr( • |po) > is minimised by po* Pr- 

4 Large deviations of trajectories 

In this section we prove, under suitable assumptions for po and \I/, the equivalence of the rate 
functionals i^ and (jH])- The latter form will be used to prove the main Gamma convergence 
theorem. First, the large deviations of the empirical process is derived. To this aim we will need to 
distinguish between two different types of potentials ^. Next, we transform these large deviation 
principles back to the large deviations of the empirical measure Ln{t) by a contraction principle, 
and finally show that the resulting rate functionals are the same for both cases. 

In the first case we consider potentials that satisfy the following 
Assumption 4.1 (The subquadratic case). Let ^> G C^(R°') such that 

1. '^ is hounded from below, 

2. there is a C > such that |x||V\I'(x)| < C(l + |xp) for all x G R"', 

3. \E' is convex, 

4- A\E' is bounded. 



Note that the second assumption indeed implies |^E'(a;)| < C(l + \x\'^). Under Assumption 14. ![ 
combined with initial condition f lT7|) . the empirical process {-£/Af(^)}o<t<T satisfies a large deviation 
principle in C([0, r],P(R'^)) with good rate functional |D(;87| Th. 4.5] 

j^^p )^f UoWt-^Pt-^^iPt'^nW.dt, if p(.)GAC([0,r];2)'), ^^g^ 

I oo, otherwise. 



■^Equivalently, we can define the dynamics of Xi, . . . , Xj^ by tfie Ito stochastic equations 

dXk{t) = -V^iXkit)) dt + V2dWk{t), k^l,--- ,N 
where Wi, . . . , W^ are independent Wiener processes. 



It follows from a contraction principle |DZ87l Th. 4.2.1] and a change of variables 1 1— t- t/r that 
MpIpo) = inf — / 



^ ^ ^-r(Api + div(p,VM/)) dt. (19) 



Remark 4.2. The first assumption guarantees that the functional £ : 'P(R'^) — )■ (— oo, oo] is well 
defined. The last two assumptions are not necessary to derive fllSI) .' however we will need them in 
the sequel. Especially the last one is a technical assumption that we will need in Lemma \4^ It 
can be relaxed in several ways, but for simplicity we chose not to. 



curve 



Remark 4.3. In 1^^ we implicitly set ^ J^ Waf- ^{^Pt + div(ptV^))||?.i ^^ dt = oo if the 

is not absolutely continuous in distributional sense. Therefore, from now on, we shall only consider 

curves in C{po,p) or Cw2{po, p) that are absolutely continuous in distributional sense. 



In the second case we require a combination of assumptions on \1/ that were taken from |FK06j 
and |FNllj : 

Assumption 4.4 (The superquadratic case). Let \1/ G C^(R'^) such that: 



1. There is some A.^ G R such that z^D'^'^{x)z > A* |z| for all x,z ^ R'^; 

2. /j^,*(x)e-2*Wrfx< cx); 

3. \1/ has superquadratic growth at infinity, i.e. lim|^|_^oo i^p = +oo; 

4. There exists an u & C*(R+) '"'^^^ '^(O) = such that for all x,y ^ R'^ 

^{y) - ^(x) < uj{\y - x|)(l + ^(x)), 
|*(l/) - ^(x)p < uj{\y - x|)(l + |V^(x)p + ^(x)); 

5. ( := |V\E'p — 2A\1' has superquadratic growth at infinity, i.e. lim|^|_^oo x^ = +00; 

6. There is some A^ G R such that z'^D'^({x)z > A^ \z\'' for all x,z E R'^. 



Whenever Assumption 14.41 and initial condition flT7|) hold, then by |FK06[ Th. 13.37] the 
process {Ljv(t)}o<i<r satisfies a large deviation principle in Cvi/2([0, r],P2(R'^)) with good rate 
functional (fTSll . 

Remark 4.5. Contrary to the subquadratic case, the latter is actually a large deviation principle 
on the set of all continuous paths in 7^2 (R"') with respect to the Wasserstein topology. Although we 
strongly believe that this is also true for the subquadratic case, it is very difficult to prove due to 
the fact that the functional Jr does not have Wasserstein-compact sub-level sets, and therefore it 
can't be a good rate functional in Cvk2([0, t],P2(R'^)) when \1' is subquadratic. 



Again, by a contraction principle and a simple change of variables, it follows from ( 1181) that 
must be equal to: 



Jt{p\Po) = inf — 



dpi 
dt 



-{Apt + div(piV^)) 



dt. 



(20) 



-i,Pt 



Observe that in this case the infimum is taken over Wasserstein-continuous curves, while in the 
subquadratic case (IT9l) the infimum was over narrowly continuous curves. However, we will prove 
that under the extra assumption that po ^ 'P2{,^'^) and J^(po) is finite, even in the subquadratic 
case the infimum can be taken over Cw2{PoyP)- Actually, we will prove something even stronger, 
that we will need in the sequel, namely the following: 



Proposition 4.6. Let ^ E C^(R°') satisfy Assumption \4.1\ Let pq E P2(R'^) 'with J^(po) < C)0, and 
assume p(.) E C{po,p) with Jr{p{.)) finite. Then pt E V2{^^) for every t. Furthermore, the curve 
Pi^.) lies in ACvk2 ([0; 1]) ^2(R'^)) and J-'{pt) is absolutely continuous with respect to t. Finally there 
holds: 



1 



dpi 
dt 



- r{Apt + div(piV^)) 



1 

47 



dpt 



dt 



dt 

-i,Pt 

2 

T 

dt + - 
4 J, 



/ ||gradJ^(pi)||^i_^^ dt + 2-^(Pi) 



Ihpo] 



Before we prove this theorem we prove two auxiliary lemmas. 
Lemma 4.7. Assume that 



i. \E' G C^(R'^) satisfies Assumption \4.1 , 

2. J'^poidx) < oo, 

3. p(.) gC(po,p), 
4- Mp(-)) < oo. 



Then 



Jr''- 



\V^{x)\'^Pt{dx)dt < oo. 



(21) 



Proof. For simplicity we take r = 1. We will prove the following statement: there exist < 6 < 1 
and a, P > that depend only on \E' such that 



a sup / I \I' I dpt + (3 

tG[0,<5] jRd 



4 



~'R<* 



V^rc/pic/t<8Ji(p(.)) + -|inf^| + - / ^t/po + — IIA^IIoo. (22) 



26. 



e ./od 



10 



Obviously ( 12T1) follows from (!22l) by repeating it 1/5 times. 



We will approximate \E' by a sequence of C^(R'^) functions which are allowed in the definition 
he norm || • ||_i. To account for the compact support we use the usual bump function: 



of the norm 



0, Ixl > 1. 



(24) 



Define rik{x) := r]{x/k). Then the following estimates hold 

\Vk{x)\ < 1/e, |Vr/fc(x)| < - and \Ar]k{x)\ < -pHA^^Hoo < oo. (23) 

Since rjk^ eT> the rate functional ( ITSl) is bounded from below by 

4^1 (P{.)) = / sup ({dtpt - Apt - div(piV^),p) -\ f \Vp\^dpt\ dt 
Jo pev V -^ JRd / 

> f ({dtPt - Apt - div(ptV^),r/fc^) -If |V(r/fc^)prfA j dt 

for any s G [0, 1]. We now estimate each term in the right-hand side of (^^. For the first term, 
we have 

/ {dtPt,Vk'^)dt= / rik'^dp,- / r/fe^rfpo> / r^fc]*] rfp, - -| inf ^| - / r]k^ dpo. (25) 
Jo JpA Jr'' jRd e Jp^d 

For the second part, we find 

- / {Apt,r]k^)dt = - / / {mAr]k + 2Vr]k-V^ + yikA^)dptdt 
Jo Jo Jr'^ 

>- [ [ (|A%||^| + |Vr7fc|(|V^P + l) + r/fc|A^|)dpidt 
Jo Jr'' 

' / (h\Arj\U^\ + 7(|V^P + 1) + -|A^l) dptdt 



> - 



> -^||Ar/|U sup / \^\dpt-jff \V^\^dptdt-j--\\A^\ 

^ telO,s] J [-k,k]'i- ^ Jo J[-k,k]'i- ^ 6 



(26) 



11 



Finally, for the last part 

1 



dw{ptV^),r]k^) -^ I \V{vk^)\'dpt ) dt 



CSJ/'^/'/l 21 3 \ 



Jo J[~k,kr V 2P 'A; "2 '4 



(27) 



5sC(i + p) /• ,^,^ r /■ //3 1 

\- '- sup / \^\dpt+ / / -- 



>- ^"^'^;,' '^ ' sup / |^|rfpi+ / / ( ( ^r/fc-^ ) iV^n^picit, 



where the fourth line follows from Young's inequality, and in the fifth line we used the subquadratic 
assumption. Substituting (ES]), ([26]) and ([27]) into ([24]) we get 

/ 7]kMdps+ / / -77fc|V^|^rfpidt<4Ji(p(.)) + -|inf^|+ / r/fc^rfpo + f + -||A^||oo 
jRd Jo J[-fc,fc]d4 e J^d He 

+ {U\^^\\oo+^-^^^^^) sup / rndPt+iWr) f I W^'dp^dt. 

If we first discard the first term on the left-hand side and maximise the equation over s G [0,S] for 
some < 6 < 1, then discard the second term and maximise, the sum of the inequalities can be 
written as 

<8Ji(p(.)) + -|inf^|+2 / ,^,^dpo + - + -||A^|U. (28) 
e Jp^d K e 

Taking the supremum over /c > 1, the inequality ( l28l) becomes 

(--55C)sup I mdpt+{^-\) ! [ \V^\^dptdt 
^ , te[o,5] jRd 4e ^ 4^ Jq iR<* 

:=/3 

< 8Ji(p(.)) + -|inf^| + 2supi / r/fc^rfpol +2(5 + — IIA^Iloo 
e A: URd J e 

<8Ji(p(.)) + -|inf^| + - / ^rfpo + 2(5+— IIA^Iloo, 
e e jRd e 



as snpi. Jrjk'^ dpo < snp,. Jr]k\'^\dpQ < 1/e J\^\dpQ < l/ej{'^ + 2| inf \I^|) (ipg. Take 5 such that 
a > 0. Now that we know that the suprema are finite, we can take the limit /c — )■ oo of (125]) . which 
proves ([22]). D 
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The second auxiliary lemma is: 



d _l^|2 



Lemma 4.8. Let e > and p{x) dx E V{R^) be given. Let 9{x) := [^) ^ e^^~ he the density of 
the d-dimensional normal distribution. We define 6^{x) := e~'^9{-) and p^ := p * 9^. Then there 
exists a constant C^ that depends only on e such that \\^{Pe)\\-i^p^ < C^. 



Proof. We have 

Vpe{x) = {p*Ve^){x)= j p{x-y)V9,{y)dy= -e^'^ p{x - y)y9,{y) dy. 

Furthermore 

|Vp,(x)p < e""^ / p{x-y)\y\^9,{y)dy j p{x - y)9^{y) dy < e~'^pe{x) p{x - y)\y\^9^{y) dy. 

jR<i jR<i jRd 

Now 



l|A(P.)ll-i,p. 



|Vp.(a:)P 



dx<e^l I p{x-y)\y\^9^{y)dydx 

iRd jRd 

-4 / / „/_ „A J_ |„.|2/ 



Rd J-p^d 



p{x-y)dx\y\ 9,{y) dy 



<e-' [ \y\%{y)dy:=C,. 

jRd 



We are now ready to proceed with the 



D 



Proof of Proposition \4.6\ Let p(.) satisfy the assumptions (of Proposition I4l6|) . By Lemma [4. 71 we 
have 



^R 



and therefore 



1 f\,8Pt 



Tri lllF--^'"ll--*<^ 



2r 



\V'^{x)\^pt{dx)dt < oo 



^-r(Ap, + div(p,Vvl/)) 
ot 



dt 



hPt 
1 



+ - / / \V'^ypt{dx)dt < cx). 

2 ./o jRd 



Take a < s < L Since 



At 



dpt . 
-dt - ^^P^ 



dt < oo 



(29) 



-i,Pt 



we have that W-^ — rAptW^i^p^ < oo for almost every t. By |FK06l Lem. D.34] there is a i^t G L'^{pt 



such that 



dpt 
dt 



-Apt = - div {vtpt) 
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in distributional sense. Take 9e{x) as in Lemma [4.81 Then we have 



dt 



where 



pt,e= Pt*Oeix), Vt 



rApt^, = - div {Vt,ePt,e), 

(VtPt) *Oe{x) 



Pu 



By |AGS08[ Th. 8.1.9] we have 



. , — ?^ TApteW 1 dt = 



\^tA\v^{pt,e) 



dt< — 



4^/^ :I^*IIl^(pO 



dt 



At 



dpt . 



dt. (30) 



-i,pt 



Furthermore by Lemma [4.81 we have that 

f\\Apa\.,,jt<c, 

Jo 

\\dpt,e\\2 



and therefore 



dt "-I'p*. 



dt < oo. 



(31) 



(32) 



From 0311) and since p(0) G P2(R'^)) by using |FK06[ Lem. D.34] and Lemma [2.11 we get that the 
curve pt^e is absolutely continuous in V2{'R'^)- In addition, it is a straightforward that S{pt^e) is 
finite for every < t < s. From ([21]), (I32D and by Lemma [231 <S{pt,e) is absolutely continuous with 
respect to t. Hence we obtain 



1 '^ 

4r, 
1 

"47^0 
1 

~ 47 






I (9p 



t,e l|2 



at 







i,Pi'^^ + I / ll^aJ-i,p,^^ + ^ y f grad5(pi,,), 

I^Pm||2 j. , ^ Tiia I|2 , ^c^ a ^c^ a 



dt 



dt 



-i,pt 



It follows that 



Now letting e go to zero and by the lower semicontinuity of the entropy and the Fisher information 
functionals we get S{ps) < c>o and f^ || Api||^;^^^^ dt < oo. Therefore 



\dfH\\2 

I Qt \\-l,Pt 



dt<2 



\^Pt A l|2 J+ , 



^Pt\\-iptdt < oo 



and 
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^' II Ap, + divptV^Wjt < 2 (^^' II Ap,f_^^^^ dt + ^' £ I Vvl/ 



(x)pp((a;) (ixrft ) < oo. 



By Lemma ITT] and Lemma 1^751 again, the curve pt is in AC\y2 ([0; 1]) '^2(R'^))- Moreover, t h-> J-'(p( 
is absolutely continuous and (TT31) holds. Hence we have 



4r 



^-r(Ap, + div(p,Vvl/)) 



dt 



-i,Pt 



^rJo 



This finishes the proof of the Lemma. 



I' ^ dt + ^'||Ap, + div(p,VvI/))f_,_^^ dt + ^Tip^)-\Tipo). 



D 



Remark 4.9. For the superquadratic case, the above lemma was proved by Feng and Nguyen in 
IFNl If by using probabilistic tools. In addition, they obtain an estimate for the growth of J^ along 
the curves. 



Now the following is a straightforward result: 



Corollary 4.10. Let po G 7^2 (R) with J'(po) < oo. //\1' G C (R ) satisfies either Assumption^J_ 



or 



Mth 



en 



MpIpo) = inf 

p(.)&Cw2{Po,P) 4r JO 



^ '"'^-g-r{Ap, + dW{p,Vn\L,,,dt. 



5 Lower bound 



In this section we prove the lower bound of the Gamma convergence (jTj) in our main result, 
Theorem 11.11 



Theorem 5.1 (Lower bound). Under the assumptions of Theorem \1.1\ we have for any pi G 
P2(R'^) o,nd all sequences pi G P2(R'^) narrowly converging to pi 



hminf [ MpHpo) - ^^i^) > 1.F(pO - i^(po). 



(33) 



Proof. Take any sequence pi G 7^2 (R'^) narrowly converging to a pi G V2{'R'^)- We only need to 
consider those p[ for which Jr(Pi|po) < oo. For each such p[, by the definition of infimum there 
exists a curve p[ G C(po, pi) satisfying 



1 

47 



11^ - r(ApI + div(pl "^nlW^r dt < MpHpo) + r < oo. 



(34) 
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By Lemma US] for the subquadratic case and |FNlll Lem. 2.6] for the superquadratic case, we 
have 






4r.o 



= liHpl) - Hpo)) + ^l' ll^ll-i.pr ^^ + Z /' II ^'^^J'(Pt))\Wpi dt 
>\{Hpl)'Hp^)) + ^wi{p,,pi). 

In the last inequahty above we have used the Benamou-Brenier formula for the Wasserstein distance 
[BBOO] . Finally, using p[ — )■ pi narrowly with the narrow lower semi-continuity of J-", we find that 

hminf (j.(p[|po) - M^^ > l^(p,) _ 1^(^„). 

D 



6 Recovery sequence 



In this section we prove the upper bound of the Gamma convergence ([7]). This will conclude 
the proof of Theorem 11.11 

Theorem 6.1 (Recovery sequence). Under the assumptions of Theorem \1.1\ for any pi G 'P2(R) 
there exists a sequence pi G 7^2 (R) converging to pi in the Wasserstein metric such that 

hmsup (upWpo) - S^l^'j < U{pi) - \s{p,). (35) 



As mentioned in Section [H our approach for the recovery sequence only works for d = 1. Hence 
throughout this section, we will consider d = 1. 

The existence of the recovery sequence is proven by making use of the following denseness 
argument, which is also interesting in its owiu: 

Proposition 6.2. Let (X, rf) he a metric space and let Q he a dense suhset of X. If {Kn,n G n} 
and K^o are functions from X to R such that: 



*A more or less similar idea can be found in [Bra021 Remark 1.29]; Proposition 16.21 is slightly stronger. 
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(a) Kn{q) -^ Koo{q) for all q eQ, 

(b) for every x E X there exists a sequence qn E Q with g„ — )■ a; and Koo{qn) -^ Koo{x), 

then for every x E X there exists a sequence r„ G Q, with r„ — >■ x such that Knir'n) — )■ Kao{x). 

Proof. The proof is by a diagonal argument. Take any x E X and take the corresponding sequence 
g„, — !■ X such that Koo{qn) -> Koo{x). By assumption, for any q E Q and L > there exists a ni^q 
such that for any n > ra^g there holds d{Kn{q),Koo{q)) < l/L. Define 

{1, I<n<n2,<j2, 
2, ^2,52 <n< max{n2,g2,n3,g3}, 

Take the subsequence r„ := qi^. Observe that /„ — )■ oo as n — )■ oo such that indeed qi^ — > x, and: 
d{Kn{qiJ,K^{x)) < d{Kr,{qiJ,K^{qi„))+d{K^{qiJ,K^{x))^0. 



^^ 



D 



For a fixed po satisfying the assumptions of Theorem II. ![ we want to apply Proposition 16.21 to 
the situation where 

X=P2(R), 

Q = Q{po) = \p = p{x)dx E 7^2 (R) : pix) is bounded from below by a positive constant in every compact set 

J-'(p), ||Ap||?_i p, / |V\E'(x)|^p(x) dx < oo, and there exists a M > such that 

po{x) = p{x) for all |x| > M k 

Knip) = JhAp Ipo) ^ — , where hn an arbitrary sequence converging to zero, 

Ahn 

Kooip) = I^P) - l^Po). 



Assumption (a) of Proposition 16.21 i.e. pointwise convergence for every pi E Q{po), can be 
proven as follows. Take pi E Q{po) and let pt be the geodesic that connects po and pi. In the 
following Lemma I6l3| we will prove that || Ap(||^^ and J^ {V"^ {x)\'^ pt{x)dx are uniformly bounded, 
so that we have 



dpi 
dt 



r{Apt + div(piV^)) 



dt 



-i,Pt 



< 3 / ^ dt + 3T^ [ \\Apt\W^^ dt + 3r2 / II div(p,Vvl/)f_,^^^ dt < oo. 

^0 C*^ -l,pt ^0 ^0 
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By applying Lemma SSI for the subquadratic case or jFNlll Lem. 2.6] for the superquadratic case: 



lim(J.(pi|po)-^S(^)<lin, 



r / Y / (i^tipf ^ |Vvl/(x)|Vi(x) ]dx]dt 



\Jr V Pt[x 



+ Ij'ipi) - l^Po) 



2-^(^0 - 2'^iPo)- 



The pointwise convergence then fohows from this together with the fower bound (!33|l . 

To prove the uniform bounds: 

Lemma 6.3. Let \1/ G C^(r) be convex. Let po = p{x)dx G 'P2(R) ^e asolutely continuous with 
respect to the Lesbegue measure, where p{x) is bounded from below by a positive constant in every 
compact set. Let pi G Q{po) and pt be the geodesic that connects po and pi. Assume that J-'{po) , 
II^Po||-i,po (^"i^d J^ \'V'^{x)\'^pQ{x)dx are all finite. ThenJ^{pt), ||Apt||^^^^^ ^nd J^ \'V'^{x)\'^pt{x) dx 
are uniformly bounded with respect to t. 



Proof. Let T{x) be the optimal map that transports po{dx) to pi{dx). The geodesic that connects 
Po and pi is defined by 

pt{x) = ((1 - t)x + tT{x))^po{x). 

First we prove that ||Apj||^^ is uniformly bounded with respect to t. In the real line, the map 
T(x) can be determined via the cumulative distribution functions as follows |Vil03t Section 2.2]). 
Let F{x) and G{x) be respectively the cumulative distribution functions oi p{dx) and pi{dx), i.e. 

/X PX 

Po{x) dx; G{x) = I pi{x)dx. 
-OO J —OQ 

Then T = 0'^ oF. We have 

F{M) + / po(x) dx = G{M) + / pi{x) dx = 1. (36) 

Jm Jm 

From (136|) and by the assumption that po{x) = pi{x) for all |x| > M we find that F{M) = G{M). 
Hence for all x such that Ixl > M we have 



F{x) = F{M) + / po(x) dx = G{M) + / pi(x) dx = G{x). 
Jm Jm 

Consequentially, for all x with |x| > M we have T{x) = {G^^ o F){x) = x. Therefore T'{x) = 1 for 
all I a; I > M. This, together with the fact that T is a C^ function, implies that T'{x) is bounded. 
Moreover T{x) satisfies the Monge - Ampere equation. 

Poix) = pi{T{x))T\x). 

or equivalently (since pi{x) > 0), 
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Since the densities po,Pi are absolutely continuous (recall that ^/p^, ^/p^ G H^{r)) and T'{x) in 
C^ and strictly positive, we get 

T"(x) 

= (log(po(x))-log(pi(T(x)))' 
_ p- (x) p[{T{x))r{x) 
Po{x) pi{T{x)) 

Set Tt{x) =tx + {l- t)T{x). For < t < 1 we have 

pt{x)=pr(Tt{x))T[{x), (38) 

Since pi{Tt{x)) and T/(a;) are both absolutely continuous so is pt(x). Hence the derivative appeared 
in (IT3l) for ||Api||^]^^^^ is the classical derivative. Substituting (l38l) into (IT3l) we get 

(P;(^))' ^^ ^ /• [(pi(T,(x))T/(x))f ^^ 



Pt(a;) 7r Pi(Ti(a;))T/(x) 



R 



[p',mx))n{xf+p,mx))Ti\x)f 

Pi(T,(x))r/(x) 



< . /• (pUZiMlMMl! ^ I g /• (Pim(x))T;-(x))^ 

- A Pi(T,(x))r/(x) ^ A Pi(T,(x))T/(x) 

^2 /M^Mll!(r;(a;))3rfx + 2 I Pr{T,{x))^^ dx (39) 

Note that in the inequality above we have used the Cauchy - Schwarz inequality (a+6)^ < 2(a^+6^). 
To proceed we will estimate each term in the right hand side of 0391) using the fact that |T'(x)| is 
bounded and ||Apo||^i p^, ||Api||^]^p^ < oo. For the first part we have 

R Pi{Tt{x)) J^ Pi{Tt{x)) 

A Pi{Tt{x)) 



R Pl[X) 

C'\\Ap,\\'. (40) 



'ill-i,Pi 
Let B be the ball of radius M centered at the origin. Since T"{x) = for all |a;| > M we can 
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restrict our calculation for the second part in the ball B. 

Pi{T,{x))^!:ff dx = [ Pim{x))^^dx 

pAUx) r '.^ " dx 



B PoK^) V Po\^) 



B 



Pr(T{x)) V PiiT{x)) 



'\x\>M 

|2 



<C j pt{x)dx+ / \V'^{x)\yi{x)dx 

'b J\x\>M 



<2 f p,mx)mx)(p!^y dx 
Jb \Po{x)J 

+ 2/p.(r.W)r;w(^l(I(|M£))^,, 

Jb V Pi{T{x)) J 

2 Pim{x))T;{x) ( {p',{x)r ^^ 



\Jb Po{x) Jb Pi{T{x)) 

<C(||Apof_i,po + ||Apif_i,,J. (41) 

From (1391), (iO]) and dUD we find that 

It remains to prove the boundedness of the functional j^ \W'^{x)\'^pt{x)dx. 

Since T{x) = x for |x| > M we have Pi(x) = pi{x) for |x| > M. Hence 

/ \V^{x)\'^pt{x)dx = / |V^(x)|Vt(a;)c?x+ / \V^{x)\'^pt{x) dx 
Jr Jb J\x\>m 

= / \V^{x)\'^pt{x)dx+ / |V^(x)|Vi(a^)c?3; 
Jb J\x\>m 



<C + I |V^(a;)| V(a;) rfa^ < oo. 
Finally the result for J-'{pt) comes from the fact that J-" is geodesically convex. D 



Finally, to prove assumption (b) of Proposition I6.2[ i.e. the existence of the recovery sequence 
in the dense set. 
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Lemma 6.4. Let po, pi G P2(R) o,nd "^ G C^(r) with \I/(x) > —A — i?|a;p for some positive 
constants (this includes both our cases). Assume that po is bounded from below by a positive 



constant in every compact set and that J-'(po)) II^Pol 



-l,Po 



an 



d J^„ |V\l/(x)ppi(x) dx are all finite. 



Then, there exists a sequence fc„ G Q{po) such that kn — )■ pi with respect to Wasserstein distance, 
and F{kn) — )■ J^{pi)- 



Proof. We will assume that J^(pi) < oo, otherwise the construction is trivial. Let n G N. Since 
J^ Po{x)x'^ dx < cxD and J^ po^x)]"^ {x)\ dx < oo there is a set Ai of finite Lebesgue measure such 
that for every x E Ai we have that po{x) < min{ ^,^^. ,, , ^}. Similarly there is a set A2 of finite 

Lebesgue measure such that for every x G A2 we have that pi(x) < m.m{ ,^,.^ , j^}. We can even 
ask for A2 to contain only Lebesgue points of pi to compensate for the lack of continuity. 



Let M„ > 1 with M„ G Ai fl A2 such that 

Pi{x) + \pi{x) logpj(x)| + pi(x)x^ + pi(x)|\E'(x)| 



/. 



B=(0,M„) 



dx < —, "i = 1, 2. 



Let 6^ be as in Lemma 14.81 By the theory of mollifications there is a ^^(n) that satisfies the 
following 



L(o>f„) |((Pi'" * ^<n))(x) - pi(x))^(x)| (/x < i, 

/iJ(0,A/„)(pf" * ^.(n))(a;) log(pf" * Oe{n)){x) - Pi(x) logpi(x) dx 

(pf"*^,(„))(M„)<min{^^^,i}, 



< 



(Pl *^e(n))(x)>0, VxG5(0,M„ 



where 



,M„ 



Pi 



Pi(x) if \x\ < Mn 
if Ixl > Mn. 



Since '^{x) is continuous, there is a < a < 1 such that for x G [— M„ — a, — M„ + a] U [M„ — 
a,M„ + a] we have po(M„) < min{^^^^, ;^} and (pf" * ^e(n))(M„) < minj^^^^^, ^}. Now 
define 



9l,n{x) 



f(pf"*^.(n))(x) if |x|<M„, 

(Pf" * ^.(n))(M„)(^^M^)2 if M„ < X < M„ + a, 

(pf"*^.(n))(M„)(^^±Ma±«)2 if _M„-a<x<-M„ 

if Ixl >Mn + a, 
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and 



92,n[^) 



(0 

Po(M„)(^)2 



if |x| < Mn — a, 

if Mn- a < X < Mn, 

if - Mn<x < -Mn + a, 

if \x\ > Mn. 



Also, 



It is easy to check that ||A(7j^„||^^^g, ^|f| and / f7i,„|V\l/p are finite for each i = 1,2 and n E N. 



S{g 



l,n) 



{gi,nix) hggi^nix) - pi{x) log(pi(s)) dx+ pi{x) log(pi(x)) dx 

B(0,Mn) JB{0,Mn) 



+ I gi,n{^) ^og{gi,n{x)) dx -)■ S{pi] 

' Mn<\x\<M„+a 



as n — )• oo. 



by 



Furthermore, by construction we have ||5'i,n||i — ^ 1) ||fi'2,n||i -^ and we can finally define fc„(x) 



1 / \ ^ \^ ~ \\g2,n\\i , I X 

llfl'l,n||l 



(42) 



where || • ||i is the -^^(R) norm. We have that kn is absolutely continuous and 



\\^kn\W,„ 



{k'nix))^ , f ((^l'"(^)niiSfr + /r ^2,n(x))') 



kn[X) 



dx < 



< 



giA^) \il'"':u + g2,ni^) 



dx 



Il9l,n||l 



2(^. (^^idkvJU 



||5l,n||l 



R ginix) II 'h 
^i-,n\ J \\gi,„\\i 



dx + 



^{g'2,ni^)y 



+ g2,n{^) -^R gi,ni^) \\g^^^"\, + g2,ni^. 



dx 



^ , i-\\g2,n\\i^igU^))\ ^ f ^JgU^)? , 

< I — — dx + I r^ — dx 



< 



R Il5'l,n||l gi,n[X 

2(1 - ||5'2,n||l)|| . 1,2 



R g2,n[^) 
2 



lfl'l,n||l 



^^l."lr-l,9i,„+2||A^2,n||ll 



32, n" 



^This is a slight abuse of notation since gi^n are actually sub-probability measures. 
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Hence ||AA;„||^j^^ < oo. For the entropy functional we have: 



\S{K)-S{g 



1,11 



kn{x) \og{kn{x)) dx - / gi^n{x) \og{gi^n{x)) dx 



< 



B(0,Mn-a) 



kn{x) \og{kn{x)) - gi,n{x) \og{gi^n{x)) 



dx 



+ 



+ 



Mn~a<\x\<Mn+a 



(I) 



kn{x) \og{kn{x)) - gi,n{x) \og{gi^n{x)) 



dx 



(11) 



B=(0,M„+a) 



kn{x) \og{kn{x)) - gi,n{x) \og{gi^n{x)) 



dx . 



(Ill) 



We now show that each of the three parts convergence to as n — )■ oo. For the first part: 



(/) 



B{0,M„-a) 

1 ^ ||5'2,n||l 



1 " ||5'2,n||l ,_ _^ /^ ^ /„\ 1 ~ ||5'2,n||l 



lfl'l,n||l 



Ifi'l.nlll 



B(0,M„-a) 



log gi,ni^) 



lfi'l,«l|l 



- giAx)\og{giAx)) 



dx 



gi,nix)^og{gi.a{x)) 



H n n — log 

llfi'l.nlll 



dx 

1 — ||5'2,n||l 

llfi'l.nlll J B{Q,Mn.-a) 



\gi,ri{,x)\ dx — )■ 0. 



For the second part: 

(//) < / \\kn{x)\og{kn{x))\ + \giAx)\og{giAx))\\dx 

J Mn-a<\x\<M„+a ^ ' 

= / |(fl'i,n(a;)— n if-^ + fl'2,n(a;))log(5fi,„(x)— j^ + g2,n{x))\ dx 

J M„-a<\x\<M„+a llfi'l,n||l Il5'l,n||l 

+ / |fi'i,n(a;)log(5fi,„(a;))|(ix. 

Since gi,n{x),g2,n{x) are smaller than - in M„ — a < |a;| < M„ + a, the right hand side converges 
to zero. 

Part (///) is smaller than - by the first property of M„ and therefore it converges to zero. 
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Finally 

r ill II 
\kn{x) - pi{x)\\-^{x)\dx = / |5fi,„(x)— ^ Y-^ + g2,n{^) - Pi{x)\\^{x)\dx 

Jk Il5'l,n||l 

V Il5'l,n||l / A 



1 - llfi'2,n| 



\^{x)\pi{x) dx + \'^{x)\g2,n{x) dx -^ 
Jr 



\\9l,n\\l 

Hence the second property of Proposition 16.21 is satisfied. D 
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